Indexing Methods for Protein Tertiary and Predicted Structures
نویسنده
چکیده
This thesis focuses on the problem of fast sub-structure search and remote homology detection in proteins by finding similar (sub)structures. That is, for a given query protein and a large database of protein structures, we want to retrieve all the similar structures from the database rapidly. With the growing number of proteins deposited in the database, searching the database is a difficult and time-consuming task. In fact, high throughput proteomics methods are already accumulating the protein interaction data that we would wish to model, but fast computational methods for database searching lag far behind; biologists are in need of a means to search the protein structure databases rapidly, similar to the way BLAST rapidly searches the sequence databases. We are interested in two main problems that arise in sub-structure and remote homology searches, namely protein tertiary structure indexing and predicted structure indexing for those proteins whose structures have not been determined experimentally. In our tertiary structure indexing approach, a new method for extracting the local feature vectors of protein structures is presented. Each residue is represented by a triangle, and the correlation between a set of residues is described by the distances between Cα atoms and the angles between the normals of planes in which the triangles lie. The normalized local feature vectors are indexed using a suffix tree. For all query segments, suffix trees can be used effectively to retrieve the maximal matches, which are then chained to obtain alignments with database proteins. Similar proteins are selected by their alignment score against the query. In our predicted structure indexing approach, a hidden Markov model (HMMSTR) of high sequence-structure local motifs (I-sites library) is used to generate the feature vectors for the structure predicted for a given sequence. Remote homologous proteins are detected by using the suffix tree index over the predicted structures. We test our algorithms on several real datasets. We improve both the time and accuracy perfor-
منابع مشابه
In silico Analysis and Modeling of ACP-MIP–PilQ Chimeric Antigen from Neisseria meningitidis Serogroup B
Background: Neisseria meningitidis, a life-threatening human pathogen with the potential to cause large epidemics, can be isolated from the nasopharynx of 5–15% of adults. The aim of the current study was to evaluate biophysical and biochemical properties and immunological aspects of chimeric acyl-carrier protein-macrophage infectivity potentiator protein-type IV pilus biogenesis protein ...
متن کاملDesigning and analyzing the structure of Tat-BoNT/A(1-448) fusion protein: An in silico approach
Clostridium botulinum type A (BoNT/A) produces a neurotoxin recently found to be useful as an injectable drug for the treatment of abnormal muscle contractions. The catalytic domain of this toxin which is responsible for the main toxin activity is a zinc metalloprotease that inhibits the release of neurotransmitter mediators in neuromuscular junctions. A cell penetrating cationic peptide, Tat, ...
متن کاملIn Silico Analysis of Primary Sequence and Tertiary Structure of Lepidium Draba Peroxidase
Peroxidase enzymes are vastly applicable in industry and diagnosiss. Recently, we introduced a new kind of peroxidase gene from Lepidium draba (LDP). According to protein multiple sequence alignment results, LDP had 93% similarity and 88.96% identity with horseradish peroxidase C1A (HRP C1A). In the current study we employed in silico tools to determine, to which group of peroxidase enzymes LDP...
متن کاملEfficient protein structure search using indexing methods
Understanding functions of proteins is one of the most important challenges in many studies of biological processes. The function of a protein can be predicted by analyzing the functions of structurally similar proteins, thus finding structurally similar proteins accurately and efficiently from a large set of proteins is crucial. A protein structure can be represented as a vector by 3D-Zernike ...
متن کاملمطالعه اسپکتروسکوپی اثر محافظ پرتویی نمک سریم بر تغییرات ساختاری پروتئین آلبومین سرم گاوی القاء شده توسط پرتو گاما
Ionizing radiation such as γ-radiation causes deleterious effects on protein.In this research,the effect of gamma radiation in the therapeutic dose of3 Gy and radioprotection of cerium salt (H8N8CeO18) on the structure and surface charge of bovine serum albumin (BSA) was studied. The first, secondary, tertiary structure and surface charge of BSA using UV-Vis spectroscopy, Circular dichroi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006